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Abstract —In our prior work, we outlined an approach, 
named DisCoF, for cooperative pathfinding in distributed sys¬ 
tems with limited sensing and communication range. Contrast¬ 
ing to prior works on cooperative pathfinding with completeness 
guarantees, which often assume the access to global informa¬ 
tion, DisCoF does not make this assumption. The implication 
is that at any given time in DisCoF, the robots may not all 
be aware of each other, which is often the case in distributed 
systems. As a result, DisCoF represents an inherently online 
approach since coordination can only be realized in an oppor¬ 
tunistic manner between robots that are within each other’s 
sensing and communication range. However, there are a few 
assumptions made in DisCoF to facilitate a formal analysis, 
which must be removed to work with distributed multi-robot 
platforms. In this paper, we present DisCoF" 1 ", which extends 
DisCoF by enabling an asynchronous solution, as well as 
providing flexible decoupling between robots for performance 
improvement. We also extend the formal results of DisCoF 
to DisCoF" 1 ". Furthermore, we evaluate our implementation 
of DisCoF + and demonstrate a simulation of it running in 
a distributed multi-robot environment. Finally, we compare 
DisCoF" 1 " with DisCoF in terms of plan quality and planning 
performance. 

I. INTRODUCTION 

While cooperative pathfinding in multi-robot systems has 
many applications, it is also fundamentally hard to solve 
(i.e., PSPACE-hard [6]). The difficulty lies in the poten¬ 
tial coupling between robots: when robots are completely 
decoupled (e.g., when robots do not impose constraints 
on each other’s plan to the goal), cooperative pathfinding 
becomes polynomial-time solvable]^] As a result, most recent 
approaches (e.g., [14], [15], [16], [17]) for pathfinding con¬ 
centrate on how to identify the dependencies between robots, 
in order to couple robots only when necessary to achieve 
computational efficiency for many problem instances. 

In these approaches, the solution is constructed for a 
subset of robots (i.e., robots that must be coupled) at any 
time, which are assumed to be decoupled with the remaining 
robots. The computational complexity is exponential only in 
the maximum number of robots in these subsets. While opti¬ 
mistic decoupling can lose optimality and even completeness 
(e.g., [15]), pessimistic decoupling can only handle situations 
in which robots are loosely coupled (e.g., [16]). 
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Meanwhile, to ensure completeness, these approaches of¬ 
ten assume access to global information, which includes 
knowledge about the current positions of the robots, and the 
robots’ individual plans to their respective goals. With this 
information, any robot can consider all other robots when 
creating its own plan. While this assumption can be made 
in many common applications of cooperative pathfinding 
where planning can be centralized and performed offline 
(e.g., cooperative pathfinding in computer games), it does 
not hold in distributed systems due to limited sensing and 
communication range. 

In our prior work [21], we introduced a window-based 
approach, called DisCoF, for cooperative pathfinding in 
distributed systems with limited sensing and communication 
range. In DisCoF, the window size corresponds to the sensing 
range of the robots. Robots can communicate with each other 
either directly or indirectly. If they are within sensing range, 
then robots may communicate directly. However, if the robots 
are out of sensing range it is still possible to communicate 
indirectly through other robots using a communication relay 
protocol. This allows for coordination beyond a single robot’s 
sensor range. 

To ensure completeness, DisCoF uses a flexible approach 
to decoupling robots such that they can transition from 
optimistic to pessimistic decoupling when necessary. Robots 
are assumed to be fully decoupled initially. During the online 
pathfinding process, robots only couple together when nec¬ 
essary (i.e., when there are predictable conflicts [21]). Since 
access to global information is not assumed, the creation of 
local couplings (i.e., subsets of robots) may not be sufficient 
due to the danger of live-locks. In such cases, a mechanism 
(called push and pull) is introduced in which robots in a 
local coupling can form a coupling group [21] in order to 
coordinate more closely. Robots in a coupling group move 
to their goals sequentially in a certain order while keeping 
others (i.e., those that have not yet reached their goals) within 
communication range. Coupling groups may increase in size 
(e.g., when previously undetected robots come within sensing 
range of a robot in the coupling group) and decrease in size 
(e.g., when robots reach their goals). This mechanism can 
potentially lead to a global coupling. 

Contributions: In this paper, we introduce an asyn¬ 
chronous variant of DisCoF, refereed to as DisCoF" 1 ", in 
order to remove DisCoF’s required assumption that time 
steps are synchronized. The Major difference of DisCoF" 1 " 
from DisCoF is that this one runs on the individual robot, 
but the previous one runs on a group of robots, depending 
on the synchronized time step for the entire robots. In order 


to make it work, we provide an asynchronous algorithm 
with its communication strategy. Then, we introduce a new 
decoupling strategy in DisCoF + that allows robots to transi¬ 
tion between optimistic and pessimistic decoupling with the 
goal of improving efficiency. Furthermore, we demonstrate a 
simulation of DisCoF" 1 " in a distributed multirobot environ¬ 
ment modelled in Webots in addition to providing the results 
of numerical experiments with which the performance of 
DisCoF and DisCoF + are compared in terms of computation 
time and length of plans. 

II. RELATED WORK 

To address the cooperative pathfinding problem, re¬ 
searchers have used a compilation approach [8], [1], [5], 
[20], in which the problem is first transformed into other 
related problems, and then the existing solutions or al¬ 
gorithms for these problems can be applied. Abstraction 
methods to reduce the search space have also been used 
[18], [13]. However, due to the inherent complexity of the 
problem, these approaches are unscalable. While approaches 
that constrain the topologies of the environment [19], [11], 
[12] can significantly reduce the complexity, they cannot be 
applied to general problem instances. 

Given that pathfinding for a single robot is polynomial¬ 
time solvable, it is clear that the complexity is a result of 
coupling between robots. As a result, researchers have con¬ 
centrated on various ways to decouple robots. For approaches 
that perform optimistic decoupling, robots are considered 
as coupled only when necessary. One of the representative 
approaches is hierarchical cooperative A* (HCA* [15]), in 
which robots plan one at a time while respecting plans 
that have already been calculated. To limit the influence of 
the previous robots on the following robots, a windowed 
HCA* is introduced to restrict this influence based on a 
pre-specified window size [15], Recently, an extension of 
WHCA* (CO-WHCA* [2]) is introduced to further reduce 
this influence based on the notion of conflicts. Although 
many problem instances can be solved efficiently, optimistic 
decoupling in these approaches leads to the loss of optimality 
and completeness. 

One of the earlier approaches that performs decoupling 
while maintaining optimality and completeness relies on 
pessimistic decoupling [16], which couples robots when 
any conflicts are detected in their entire individual plans. 
As a result, this approach tends to over-couple and hence 
remains intractable for many problem instances. More recent 
approaches relax optimality to achieve better efficiency [9], 
[4], [17]. However, to maintain completeness, these ap¬ 
proaches assume access to global information and therefore 
are inapplicable to distributed systems in which robots have 
limited sensing and communication range. 

While there are extensible approaches to distributed sys¬ 
tems (e.g., [7]) and approaches that are designed for dis¬ 
tributed systems (e.g., [10], [3]), they do not provide com¬ 
pleteness guarantees. The difficulty lies in planning without 
access to global information, which is addressed in [21], 


III. DisCoF 

In this section, we provide the problem formulation and 
review DisCoF [21]. Extensions to DisCoF (i.e., DisCoF + ) 
are discussed in Section HVl 

A. Problem Formulation 

Given an undirected graph G( V. E), and a set of robots 1Z, 
the initial locations of the robots are I C V, and the goals 
are Q C V. Any robot can move to any adjacent vertex in 
one time step or remain where they are. A plan V is a set 
of individual plans of robots, and V [*] denotes the individual 
plan for robot i GTZ. Each individual plan is composed of a 
sequence of actions. For simplicity, in this paper each action 
is represented by the next vertex to be visited. For example, 
Vk[i\ (k > 1) denotes the action to be taken at time step 
k — 1 (or the vertex to be visited at k) for robot i. V k fli\ 
(k < l ) denotes the subplan that contains the actions from 
Vk [i] to 'Pi [■/']. The goal of cooperative pathfinding is to find 
a plan V, such that robots start in I and end in Q without any 
conflicts (defined below). The location of robots at time step 
k is denoted by S k , and the location of robots after executing 
plan V from location S is denoted by S('P). Hence, So = I, 
Sq(V) = G, and Sk = So(Vi, k ). A conflict happens at time 
step k, if the following is satisfied: 

Sk[i\ = S k [j] V (S k \i] = 5 fe _i[j] A 5 fc _i[i] = S k [j}) (1) 

in which i £ 1Z, j £ TZ and i j. If two robots move to 
the same place at time k, the first condition holds. If two 
robots switch their locations in two consecutive time steps 
from fc— 1 to k, the second condition holds. Figure [I] shows 
the first condition. 

Each robot has a planner that can compute a shortest path, 
P(u, v ), that moves a robot from vertex u to v. The length of 
P{u,v) is denoted as C(u,v), i.e., C(u,v) = \P(u,v)\. The 
following simplifying assumptions are also made in DisCoF: 

1) Robots are homogeneous and equipped with a commu¬ 
nication protocol for message relay. 

2) Robots know G and are synchronized at every time step. 

Initially, for each robot i , the individual plan is constructed 

as V[i] = Q [*]). Robots then start executing their 

individual plans until conflicts can be predicted (i.e., pre¬ 
dictable conflicts in [21]) at time step k. In such cases, the 
individual plans of robots that are involved are updated from 
■Pfc+i to avoid these conflicts. 

B. Optimistic Decoupling 

In DisCoF, the window size corresponds to the sensing 
range of the robot. To reduce communication, a robot is 
allowed to communicate with other robots when it can sense 
them. However, robots that cannot sense each other can 
communicate using the message relay protocol through other 
robots. A closure of the set of robots that can communicate 
(directly or via message relay) in order to coordinate is called 
an outer closure (OC). In an OC, there can be multiple 
predictable conflicts. A closure that contains agents with 
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Fig. 1. [21]: Scenario that illustrates OC and IC. Two OCs are present { 7*1 } 
and {r' 2 . r.j }, out of which one has an IC { 7 * 3 , r,\ \ with a predictable 
conflict. The sensing ranges of the robots are shown in gray. The arrows 
show the next few steps in the individual plans. 


potential conflicts is the inner closure (IC) of the OC. Figure 
[T] shows an example of OC and IC. For details, refer to [21]. 

In DisCoF, decoupling is optimistic initially, and gradually 
becomes more pessimistic during the online planing process 
when necessary. Given an OC with predictable conflicts, in 
optimistic decoupling DisCoF updates the individual plans of 
robots to proactively resolve these conflicts, while avoiding 
introducing new conflicts within a finite horizon (which is 
specified by a parameter in DisCoF). The finite horizon is key 
to efficiency since the resolution for conflicts in the far future 
is likely to waste computation efforts given the incomplete 
information (e.g., other robots in the environment). Note that 
the window size (i.e., sensing range) in DisCoF represents a 
horizon for detecting conflicts. 

To ensure that robots are jointly making progress towards 
their goals, DisCoF uses the notion of contribution value. 
In order to resolve conflicts, plans are updated in a process 
known as conflict resolution. In this process, each robot is 
associated with a contribution value when using optimistic 
decoupling. If this process is successful, robots continue 
as fully decoupled. The contribution value is also used to 
determine cases when optimistic decoupling is insufficient, 
in which the resolution process would fail due to potential 
live-locks. When there are no potential live-locks, it is shown 
that optimistic decoupling is sufficient for robots to converge 
to their goals. Otherwise, robots within the OC use the 
following pessimistic decoupling process. 

C. Pessimistic Decoupling 

In DisCoF, when there are potential live-locks, robots 
within an OC transition to pessimistic decoupling by remain¬ 
ing within each other’s communication range (whether direct 
or indirect). These robots are referred to as a coupling group, 
and this coupling group executes a process known as push 
and pull, which allows it to merge with other groups and 
robots. Thus, the level of coupling gradually increases. In 
this way, DisCoF can naturally transition robots to be fully 
coupled when necessary. 

In push and pull, robots move to goals one at a time 
according to the priorities of subproblems (first introduced 


in [4]). However, due to the incompleteness of information 
in distributed systems, the priorities will not be fully known. 
As a result, DisCoF employs the following process. At time 
step k, for each coupling group that has been formed, DisCoF 
will: 

1) Maintain robots in the group within each other’s com¬ 
munication range; 

2) Move robots to goals one at a time based on a relaxed 
version of the priority ordering, which is consistent to 
that in [4]; 

3) Add other robots or merge with other groups that 
introduce potential conflicts with robots in the current 
group as they move to their goals. 

Unless there are potential conflicts, each coupling group 
progresses independently of other robots and coupling 
groups. These processes are described in Alg. |T| 


Algorithm 1 Pessimistic Decoupling in DisCoF for a cou¬ 
pling group oj, given the current time step k , the environment 
G, current locations S and goal locations Q 
1 : r 4— _L > Initialize the leader robot r 

2 : while 3i G oj s.t. Sk[i] 7 ^ G[i\ do 
3 : (ip, <p) 4- SenseConflict(P, OJ, S, fc, W) 

4 : if ip = 0 then 

5: k 4 — k + 1 > Increase the time step by 1 

6 : G'<^(G,u,S,G) 

7: ( S , P, W) 4- PROCEEDONESTEP^, P, k) 

8: else 

9 : oj 4— oj U ip i > Merge conflict robots with oj 

10: (/, D) 4- AssignAgentsToSubP(G, oj, S, Q) 

11: H 4- computePriority(G, oj, f, D, S, G) 

12: rt-i 

13 : if r = _L V Sk[r] = G[r] then 

14 : r 4- RemoveFromQueue(P) 

15: G' 4- (G,U),S, G) 

16: P' 4- PUSHANDPULL(G', r) 

17: P 4— P[0 : k\ + P'[:] > Update a set of plans for oj 


Alg. □ continues until all members in a coupling group oj 
reach their final goals. The termination condition is checked 
in line [2] As long as there exists a robot that has not reached 
its goal, the algorithm will continue with push and pull. In 
Mg- [3 r represents the leader of the group oj. We remark that 
there can be cases in which a robot that has already reached 
its goal may block the path of the leader r. In this case, push 
and pull will swap or rotate (similar to the operators in [4]) 
robots that have not reached their goals with this blocking 
robot in order to progress. Push and pull also ensures that 
this blocking robot moves back to its goal afterwards. 

In [21], we proved that the combination of optimistic and 
pessimistic decoupling in DisCoF guarantees completeness^] 

2 DisCoF is complete for the class of cooperative pathfinding problems 
in which there are two or more unoccupied vertices in each connected 
component, which is an extension of results in [4]. 






















IV. DISCOF+ 

In this section, we discuss the extensions to DisCoF that 
are made in a new approach named DisCoF + . First, we relax 
the assumption that robots synchronize at every time step 
(or plan step). Note that even though robots in different OCs 
cannot communicate in DisCoF, it is assumed that robots 
act in synchronized time steps (i.e., robots are given a fixed 
amount of time to finish planning and execute a single action 
at every time step). The relaxation of this synchronization is 
necessary for implementation with real distributed systems, 
since we cannot always assume the existence of a global 
clock and a fixed amount of time for each time step (e.g., the 
time required for planning for each robot may be arbitrarily 
different). We remark that each robot can still access the 
entire map. We can assume that this information is static such 
that it is initially given and does not change at all. However, 
each robot cannot recognize where other robots are if they 
are out of (indirect) communication and sensing range. This 
information is dynamic such that it changes arbitrarily. 

Furthermore, we introduce a new decoupling strategy 
such that robots are also allowed to decouple after they 
form a coupling group (i.e., executing push and pull), thus 
transitioning back to optimistic decoupling from pessimistic 
decoupling. This strategy is expected to make DisCoF + 
more computationally efficient while achieving higher qual¬ 
ity plans that require fewer steps. 

A. Asynchronous Time Steps 

Unlike DisCoF, DisCoF + allows robots in different OCs to 
proceed in dependently and asynchronously. However, robots 
within the same OC are assumed to still have synchronized 
plan steps. This is a reasonable assumption because these 
robots communicate to coordinate with each other. As a 
result of this assumption, robots who finish their current plan 
step must wait until all others in the OC also finish theirs. 
Afterwards, all members of the group start the next plan 
step at the same time in order to avoid unnecessary conflicts. 
We remark that since we assume homogeneous robots, the 
waiting time at each time step is not significant]^] 

We will explain the difference between DisCoF + algo¬ 
rithm described in Alg. [ 2 ] and DisCoF. In DisCoF"*", each 
robot i £ TZ runs the algorithm in Alg. [2] In line [ 6 ] if 
there is no conflict sensed in the current location 5[z] and 
local window W (i.e., a fixed region around .S'['/']) of robot 
i, such that the IC ip is empty, robot i can proceed one 
step (ProceedOneStep) forward in its plan P. Afterwards, 
robot i continues to the next iteration. On the other hand, if a 
conflict is sensed such that the IC if) is not empty, robot i tries 
to resolve the conflict after checking if it is already involved 
in any conflicts at line [14] If it is not involved in any conflict 
(i.e., it was executing its plan independently)]^] it forms a 
local coupling ui. It first tries to decouple optimistically 

5 Heterogeneous robots may have difference in speed, sensing & com¬ 
munication range and each robot’s size, etc. Considering these issues and 
resolving them are beyond this paper. 

4 The process when robot i is already involved in a conflict is more 
involved. Refer to Alg. [2] for details. 


Algorithm 2 DisCoF + with asynchronous time steps for a 
robot i £ TZ, given the environment G, its initial location /, 
final destination F and initial plan P from I to P; 7 denotes 
the contributions values_ 

1 : (ip,<p,u>, S[:], (?[:], 7[:], fc ) (0, 0,0,0, 0, 0, 0) 

2 : (S\i],G{i]}^(I,F) 

3 : G'<-{G,u,S,g) 

4 : ( S , P , W) <r- ProceedOneStep(G", P, i, k) 

5 : while True do 

6: (ip, <p) <- SenseConflict(P, i, S, k, W) 

7 : if ip = 0 then 

8 : k f— k + 1 > Increase the time step k by 1 

9 : G' <r- (G,w,S,Q) 

10: (S, P, W) <- ProceedOneStep(G", P, i, k) 

li: G' <r- (G, ui,S, Q) > Update G 1 with new S 

12: (7, co, P) <- RecomputeCont(G', P, i, k, 7) 

13 : else 

14 : if uj ^ 0 then 

15 : ueuU^i > Merge w with OC <p 

16: G’ <- (G,u,S,g) 

17 : P' -1- PUSHANDPULL^', Z, 7) 

18: else 

19 : to i — ip > Set w to IC ip 

20: G’ <- (G,U,S,g) 

21: P’ «- Convergence^', z,fc, <p, P, W,7) 

22: if |P'| =0 then 

23 : ui <r- <p > Set uj to OC <p 

24 : G'^<G, U,S,g) 

25 : P' «- PushAndPull(G', i, 7) 

26 : P P[0 : k\ + P' [:] 


through Convergence. If it cannot find a plan P', then 
it decouples pessimistically through PushAndPull. After 
finding a plan P', it continues to the next iteration to sense 
if there are new conflicts. We remark that our description of 
PushAndPull in Alg. [2] is simplified to show the overall 
process. Once PushAndPull returns a new plan P' in Alg. 
[ 2 ] it contains the individual plan for robot i to move from 
its location at the time step k to its goal. 

Correctness: For Alg. [2] we need to show that whenever 
there is a conflict, it always returns a valid plan. If there is 
a conflict, in line [14] robot i checks if it is already involved 
in a conflict (with ui). If ui 7 ^ 0 (i.e., it is already involved 
in a conflict), we merge the OC (i.e., <p in Alg. [2] with uj, 
and then call PushAndPull for i. In line [22] if P' is not 
empty, it means that CONVERGENCE returns a new plan P'. 
If P' is empty, then robot i calls PushAndPull. In both 
cases, the returned plan P' is either from CONVERGENCE 
or PushAndPull. We have shown that Convergence or 
PushAndPull always returns a valid plan in [21]. 

We remark that ProceedOneStep always results in the 
robot proceeding one step forward in its plan. If robot i has 
already reached its final goal (while there are robots that still 
need to reach their goals), proceeding one step in this case 
simply adds a step for robot i to stay. However, note that 







when robot i blocks other robots after reaching its goal, its 
plan can be updated by these other robots (i.e., forcing robot 
i to move off its goal temporarily). 

B. Communication and Leader Selection 

There are two major cases in which robots communicate 
with each other in DisCoF + . One is to detect predictable 
conflicts, and another is to synchronize planning and plan 
execution within the same OC. Given a robot i £ 1Z, de¬ 
tecting predictable conflicts, performed by SenseConflict, 
requires the following steps: 

1) Check nearby environment (i.e., W) through a sensor 
for other robots (e.g., a laser sensor); 

2) Compute the OC 0 of robot i; 

3) Communicate with robots in 0 to obtain their plans, 
then check if predictable conflicts exist among them; 

In the above process, the first step does not require any 
communication between robots; it only depends on sensors. 
Since robots know the environment (i.e., G), they can easily 
detect when there are moving robots nearby using range 
sensors. The second step requires to use the message relay 
protocol to compute the OC 0 . In the third step, once robot i 
obtains all the plans of robots in 0, it can check these plans 
against its own plan for predictable conflicts (from its current 
time step to the next /3 steps [21]). If conflicts are found with 
robot i’s plan, it forms a IC 0 with the conflicting robots, 
and then it communicates this back to the robots in the IC 
0 . Furthermore, while creating a new plan for robots in IC 
0 (i.e.. Convergence), this plan must respect the plans 
of other robots in the OC 0 of this IC. When such a plan 
cannot be found, the set of conflicting robots (0 initially) is 
expanded to include other robots in 0 (which are not initially 
in 0). 

Example 1 (Sensing Conflicts): Consider the scenario in 
Fig. □ In this scenario, assume that robot r 4 is robot i 
in the above procedure, so r t tries to sense a predictable 
conflict. V 4 first senses its nearby environment for other 
robots. In Fig. [T] the local window or the sensing range 
of r\ (denoted by VVj is shown as the gray region marked 
with r 4, and r,\ will detect 7 - 3 . r4 then computes the OC 
0 as {r 4 ,r 3 ,r 2 }. Since r 2 is not r 4 ’s local window, r 3 will 
relay the communication between r2 and r,\. Once 77 obtains 
both r 2 and r 3 ’s plans, it will check their plans against 
its owns plan for predictable conflicts. In this scenario, r,\ 
will recognize a predictable conflict with r 3 , which can be 
addressed using CONVERGENCE. A 

In the above procedure, the leader who computes the 
new plan is the robot who first detects the conflict. Next, 
the leader tries to resolve the conflict in the IC 0 with 
Convergence. If it cannot find a new set of plan (i.e., P' 
in Alg. [2]) through CONVERGENCE, it will continue through 
PushAndPull with the OC 0 for the IC 0 . In such cases, 
we need to choose a new leader (i.e., the robot that moves to 
its goal first), which is based on the priorities of subproblems. 

The second major case for communication is for synchro¬ 
nized planning and plan execution in an OC. This is achieved 
by ProceedOneStep. Note that robots in different OCs 


proceed independently and asynchronously. Since planning 
and plan execution are synchronized within an OC, it guar¬ 
antees that no collision can occur among robots in the OC. 
In ProceedOneStep, each robot asks if other robots have 
finished the execution of their current plan step. Once every 
robot has finished its current plan step, robots can proceed to 
the next step. This requires robots to delay executing the next 
step until they receive responses from other robots. However, 
When robots move out of the communication range, they do 
not synchronize their plan steps anymore. 

C. Flexible Decoupling 

Flexible decoupling is achieved with the help of contri¬ 
bution values. Contribution values are assigned in DisCoF 
to each robot in the CONVERGENCE process (in optimistic 
decoupling), in which the robots must compute an update 
to the current plan to avoid potential conflicts. Contribution 
values are introduced in DisCoF to ensure that robots are 
jointly making progress to their goals. In DisCoF, when the 
Convergence process fails, robots should be in a coupling 
group, running on the plan computed by PushAndPull 
until they reach their goals. In DisCoF + , however, robots 
that are executing PushAndPull can again decouple by 
checking whether certain conditions involving the contribu¬ 
tion values hold. 

Next, we discuss the new decoupling strategy in DisCoF + , 
which is illustrated in the following example. Suppose that 
a conflict is predicted between two robots. Then, an IC 0 
(initially including only the two robots) can be formed and 
there is an associated OC 0 for 0. During the CONVER¬ 
GENCE process, when the leader of 0 makes a new plan, 
the set of conflicting robots can gradually expand (until 
becoming 0) if the leader cannot find a new plan that avoids 
the conflict with the current set of conflicting robots, which 
is initially 0. When a new plan is found, DisCoF + associates 
each robot with a contribution value 7, which captures the 
individual contribution of the robot to the summation of 
shortest distances from all robots’ current locations to their 
goal locations. 

At the very beginning of a problem instance, the contri¬ 
bution value 7 is initialized to be 0 for all robots. Given a 
predictable conflict at time step k, a set of conflicting robots 
0 and a set of current locations S k for 0, a set of goal 
locations Q, the new plan Q (where \Q\ < ft £ N) should 
satisfy the followings: 

^2c(s k [i\,g[i\) +7fc-W >'52 c ( s k[i](Q[i]),G[i\) (2) 

ie4> 

V* e 0 , (3) 

where 7 k-[i] is the contribution value that is associated 
with robot i at the time step k, A], is a Boolean variable 
representing whether there is a conflict which is computed 
based on the updated individual plans and S0 [i] (Q [i]) is the 
local goal for i £ 0. 

We remark that while k in Eq. ([3]) is a constant in DisCoF, 
in DisCoF + , k represents the synchronized current time step 



for the group of robots within 0, which can be different 
from different OCs. However, note that planning and plan 
execution are synchronized within 0 until one of robots 
reaches its goal location. If it reaches its goal location, then 
it is removed from the group, not being maintained within 
the communication range anymore. 

An interesting point of Eq. {3]i is that the new plan Q may 
not satisfy Eq. Q during the execution of Q, as long as Eq. 
0 is satisfied after Q has completed. Q basically specifies a 
local goal for the robots to reach prior to resuming following 
their original shortest-path plans again. Then, the potential 
conflicts are avoided in the process. Given a predictable 
conflict at the current time step and a computed Q , the 
contribution value 7 while executing the actions in Q is 
updated for robot i in 0 as follows: 

7fc+<5[*] = C{S k [i\{Q[i]),G[i]) - C(S k+ s[i\,G[i]) (4) 

where 0 < S < |Q|. We remark that S is a relative time 
step after the robots have formed an OC. For all robots in a 
group, 6 is the same. This update continues until the robot 
become involved in other conflicts or the value becomes 0. 

In DisCoF [21], the contribution value 7 is only used 
for the Convergence process, and robots do not update 
their contribution values when a coupling group is formed 
and robots start PushAndPull. This can lead to inefficient 
behaviors, e.g., when the leader’s goal location is located 
opposite to where the others’ goals are located. This 
situation is illustrated in the following example. 

Example 2 (Narrow Corridor): Figure [2] shows an exam¬ 
ple of robot /'2 in a narrow corridor meeting with a coupling 
group { 77 , 7 * 3 } (executing PushAndPull) moving in the 
opposite direction. The coupling group {7*1, r : >} started in 
the middle corridor, and then 77 became the leader. While 
77 pushes 7"3 to clear away of its path to its goal location 
g 1 , it meets 77 . In this case, they will be merged together. 
Suppose that 77 is chosen to be the leader of the new group 
{ 77 , 77 , 7 - 3 }. Until 77 reaches its goal location < 37 , 7*2 and r 3 
will be pushed to the end of the middle corridor and then 
they will be pulled after the intersection 7 . A 

In DisCoF, the only way to reduce the size of a coupling 
group is to have the current leader reach its goal. Then, a 
new leader will be selected and the remaining robots will 
follow the new leader to its goal. This is clearly an inefficient 
solution. In DisCoF + , we use the contribution values 7 
also in PushAndPull, such that robots can decouple even 
before the leader reaches its goal. 

Next, we discuss how the contribution values can be used 
in the PushAndPull process. More specifically, we pro¬ 
vide a decoupling condition for a coupling group to check, 
which determines when the robots in the group can decouple 
while executing the PushAndPull process. Suppose that 
there is a coupling group uj. After uj computes a new plan 
P' (in PushAndPull), each robot in uj will progress using 
the plan. During this execution, robots continue recomputing 
their contribution values 7 as in Eq. 0- At any step, if the 



Fig. 2. Yellow circles are robots and red circles are their goal locations. 
Blue dashed square represents a coupling group of robot n and r%. This 
group meets another robot 7*2 moving in the opposite direction. Gray cells 
represent the intersections of corridors. 


following condition holds, then the group can be decoupled: 

^C(5 , fc [*],0[7]) +7fe-[*] >'^2C(S k+s [i],G[i\) (5) 

where k is the time step when PushAndPull starts plan¬ 
ning and k + 6 is the current time step such that 0 < 6 £ N. 
C(Sf..[i]. G[i\) is the length of the shortest-path from location 
Sfc[z] (where conflicts were predicted) to the goal location 
G[i] for each robot i £ uj. 7 k-[i\ is the contribution value 
that robot i £ uj had before the conflicts were predicted. 
C(Sk+s[i],G[i]) is the length of the shortest-path from the 
current location S 0 +( 5 [i] to the goal location Q[i] for each 
robot i £ uj. 

Intuitively, Eq. (|5]» is the condition when the summation of 
the length of the shortest-path from robots’ current locations 
to their goal locations is less than the summation of the 
length of the shortest-path from their original coupling 
locations to their goal locations plus their contribution values 
just before forming the coupling group. 

In Alg. [ 2 ] Eq. ([ 5 } is checked inside of RecomputeCont. 
If the condition holds, then the algorithm returns a new plan 
P (i.e., a shorted-path plan) with an empty coupling group uj. 
Otherwise, the algorithm returns the current plan P without 
changing the coupling group uj. 

Example 3 (Decoupling): In Fig. [2] when the coupling 
group { 77 , 7 - 3 } is merged with 77 , then conflict locations 
for { 77 , 7*2, 7 - 3 } and the contribution values (i.e., 7 ) are saved. 
For a simple illustration, assume that 7 = 0. Then, whenever 
the merged group of robots { 77 , 7 * 2 , 7 - 3 } proceed one time 
step in their plan (which is returned by PushAndPull), 
they also check the decoupling condition in Eq.Q in Re- 
COMPUTECont. However, until the leader 77 reaches its 
goal location g-\ , they cannot be decoupled. This is because 
the summation of the distance between robots’ locations to 
their goal locations keeps increasing. When 77 reaches its 
goal location gi , 77 is removed from the group. Assume that 
77 is elected as a new leader of the group. Then, 7*3 will 
be pulled until they reach the conflict location where they 
met previously. (See the place where they are placed in the 






































Fig. [2]) After passing the conflict location, r 3 and r 2 can 
be decoupled since Eq. ([5]) holds. Consequently, from the 
intersection I 2 , r 2 and r 3 can move independently to their 
goal locations. A 

When a coupling group is decoupled and it immediately 
predicts a conflict in the next iteration, it uses the conflict 
resolution process through CONVERGENCE, just as when 
fully decoupled robots have predicted conflicts. Even though 
we discussed the correctness of DisCoF + (Alg. [ 2 ]), we also 
need to show that this new decoupling strategy is not subject 
to live-locks (i.e., robots are always making joint progress 
to the goals). 

Theorem 4.1: The decoupling condition in Eq.([5} ensures 
that robots in the group gradually progress to their final goals. 

Proof: From Eq. |2} and Eq. <|4}, we know that each 
robot in the group gradually moves towards its final goal. 
Here, we show that Eq. © does not prevent any group 
member from reaching its goal. Given that we use the 
contribution value jk- when a coupling group is formed, 
in order to satisfy Eq. ([2ji when decoupling, either robots 
can all execute their original plans or CONVERGENCE must 
return a new plan which progresses robots to their local 
goals. First, their original plans definitelly make progress. 
Second, consider the case when it takes the new plan from 
Convergence. After progressing through the new plan, 
all the robots in the group will reach their local goals. 
Then, the summation of the distance from their current 
locations (which are their local goals) to their final goals 
is smaller than the summation of the distance from their 
locations (where they predicted the conflicts) to their final 
goals plus their contribution values 7 before forming the 
coupling group. Hence, we can conclude that robots would be 
making joint progress to their goals. Hence, the decoupling 
condition Eq. ([5} does not prevent the group members from 
progressing to their final goals. ■ 

V. RESULTS 

In this section, we will show experimental results. First, 
we will show a simulation result. Second, we will provide a 
result of numerical experiments. 

A. Simulation Result 

The simulation shown in Fig. [3] was created using We- 
bots 7.3.0 and the included iRobot Create models. A grid 
environment was modelled which contained 30 iRobots and 
40 obstacles placed at random (but solvable) locations. Each 
iRobot was running with a controller which implemented 
DisCoF+, however, one exception was made. Rather than 
being completely distributed and simulating ad hoc networks 
and localization, the robots communicated with a central 
supervisor which provided this information as well as syn¬ 
chronization for robots involved in a conflict. Robots in 
different outer closures acted completely asynchronously, but 
robots in the same outer closure were synchronized if a 
conflict was detected between any of the member robots. 
Ultimately, this concession will be replaced with simulated 



Fig. 3. Simulation environment represents a 20 x 20 grid world with 
10% wooden box obstacles. In this environment, there are 30 iRobot Create 
finding their path to their goal positions. 

sensors and a fully distributed environment, but for now it 
still provides valuable results. 

The target computer for the simulation was a MacBook 
Pro running Mac OS X 10.10.2 with a 2.3GHz i7 and 
16GB of RAM. The simulation was run two times: once 
with decoupling enabled and once with decoupling disabled. 
Decoupling enabled yielded a total simulation duration of 
3 minutes and 23 seconds. Out of all robots, the maximum 
number of steps required to reach its destination was 40. 
Decoupling disabled yielded a total simulation duration of 
5 minutes and 1 second. Out of all robots, the maximum 
number of steps required to reach its destination was 54. 

These results are interesting for two reasons: one is that 
decoupling enabled performs significantly better, another 
is the ratio of decoupling enabled vs decoupling disabled 
simulation time compared to that of maximum steps. With 
decoupling the simulation took 67% of the time and 74% of 
the steps that decoupling disabled did. The reason for this 
discrepancy is that with decoupling enabled there are more 
stay actions in which a robots action is to stay where it is at. 
Since robots are asynchronous except for when they are in 
a conflict, this means robots will take less time to complete 
a plan with stay actions compared to one that doesn’t. It 
is expected that environments requiring more complex plans 
will benefit from this fact even more. 

We provide the demo video for this simulation. Refer the 
videos to the following url: https ://www. assembla . 
com/spaces/discof/wiki/DisCoF_Plus 

B. Numerical Experiments 

In order to evaluate the improvement of DisCoF + over 
DisCoF, we execute a number of numerical experiments. 
For these experiments, we used a Core i7 CPU 3.2 Ghz 
with 8GB memory and 240 GB SSD in Cygwin environment 
which runs on Windows 8.1. Our prototype implementation 
is written in Python 2.7.2. 

Since we only want to get the total concurrent steps 
and the computation time for these experiments, instead of 
using Webots simulator, we used a simple discrete time 
simulator which does not simulate the phisics of the robots 
or any communication between the robots. Hence, we are 
comparing the total number of steps and the computation 
times between DisCoF and DisCoF + . 

As a result of this implementation, we compute an approx¬ 
imate running time by assuming that each robot move takes 5 
seconds. Then, for each problem instance, the running time is 





DisCoF 

DisCoF+ (DisCoF+/DisCoF) 


COMP. TIME 

STEPS 

APPROX. 

RUN TIME 

COMP. TIME 

STEPS 

APPROX. 

RUN TIME 

OBSTACLES 

AVG 

STD 

AVG 

STD 

AVG 

STD 

AVG 

STD 

AVG 

STD 

AVG 

STD 

5% 

10.064 

8.405 

352.35 

356.207 

1771.815 

1788.861 

10.733 (1.0086) 

22.068 (0.931) 

63.95 (0.4266) 

80.632 (0.356) 

330.483 (0.43) 

423.885 (0.3555) 

10% 

13.19 

10.372 

521.1 

521.24 

2618.69 

2615.82 

14.37 (1.061) 

36.52 (1.538) 

73.51 (0.344) 

108.93 (0.346) 

381.92 (0.348) 

579.065 (0.348) 

15% 

17.6318 

13.296 

653.67 

580.01 

3285.982 

2911.463 

23.92 (1.217) 

49.768 (1.3) 

99.18 (0.294) 

157.356 (0.312) 

519.82 (0.3) 

831.07 (0.314) 

20% 

26.39 

14.009 

954.46 

620.08 

4798.691 

3111.208 

52.391 (1.942) 

75.8 (2.3989) 

175.9192 (0.2427) 

218.859 (0.3132) 

931.987 (0.2535) 

1161.61 (0.3242) 


TABLE I 

Numerical Experiments: Comp. Time represents the total computation time in second, Steps represent concurrent time steps 

FOR ENTIRE ROBOTS’ PLAN, AND APPROX. RUN TIME REPRESENTS APPROXIMATE RUNNING TIME IN SECOND. FOR COMP. TIME, STEPS AND 

Approx. Run Time, DisCoF and DisCoF+ have average and standard deviation. 


equal to the sum of the computation time and the maximum 
number of moving steps for a robot times 5. 

We show that the decoupling approach improves upon 
the previous approach (DisCoF) in. In order to perform the 
experimental analysis, instead of scaling up the number of 
robots, we increase the density of the environment. That 
is, we increase obstacle rates in the environment. Given 
30 robots and 20 x 20 grid environment with their goal 
locations and obstacles, the Table [T| shows the result. The 
obstacle rates changed from 5% to 20%. For this test, we 
randomly generated 100 instances for each obstacle rate. 
Obstacle locations were also randomly chosen. 

The time ratio in Table U indicates that if the environment 
is less populated, then decoupling makes better quality plans 
in terms of the total number of concurrent steps and the total 
computation time of plans. 

However, this result also shows an interesting property of 
DisCoF + . When the environment gets denser, the decoupling 
method does not always reduce the total computation times. 
This is because in dense environments groups that decouple 
may have to recouple with a higher frequency. When it is 
re-coupled, a group should make a new plan which requires 
extra computation time. On average, DisCoF + needed 34.4% 
steps less than DisCoF’s result. 

VI. CONCLUSIONS 

In this paper, we introduced DisCoF + which is an asyn¬ 
chronous extension of our previous work. We also introduced 
a strategy of decoupling in DisCoF + . Through simulations, 
we showed how DisCoF + works in a simulated grid environ¬ 
ment to resolve predictable conflicts in a distributed fashion. 
We also provided numerical experiments to compare DisCoF 
with DisCoF + . In moderately populated environments, the 
decoupling approach shows bettered results than DisCoF. 
In future work, we plan to devise different approaches for 
the decoupling such as more strict decoupling conditions 
and also a heuristic for ordering robots while performing 
PushAndPull so that when at any point time a decoupling 
occurs, the conflicts are minimized. 
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